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1.  Introduction 


New  advances  in  technology  have  led  to  the  growth  of  advanced  detection  techniques  in 
molecular  biology.  This  rise  in  turn  has  led  to  the  availability  of  large  amounts  of  biological 
data.  There  has  been  a  growing  need  to  model  biological  systems  to  further  understand 
biological  processes,  and  facilitate  the  development  of  new  techniques  for  medication  and  drug 
delivery  (i). 

Genetic  Regulatory  Networks  (GRNs)  model  the  relationships  between  the  activation  and 
deactivation  of  genes.  They  represent  the  signal  transduction  of  proteins,  as  proteins  indirectly 
and  directly  interact  to  regulate  the  expression  of  certain  genes.  GRNs  are  responsible  for  many 
of  the  most  fundamental  biological  processes,  including  T-cell  production,  mitosis,  protein 
synthesis,  metabolism,  etc.  (2). 

Modeling  GRNs  is  important  to  facilitate  medication  and  drug  delivery.  Researchers  can  target 
specific  genes  or  proteins  in  signal  transduction  pathways  to  cure  a  number  of  diseases  and 
disorders,  with  applications  in  improving  the  immune  response  and  treating  cancer  (2). 

However,  this  requires  a  detailed  knowledge  about  the  effects  of  activating  or  deactivating 
multiple  genes  in  the  network.  Statistical  and  logical  models  are  natural  choices  to  represent  the 
complex  relationships  between  genes  and  proteins  as  a  signal  propagates  through  the  pathway. 
Furthermore,  modeling  techniques  must  be  flexible  and  provide  a  basis  to  conduct  inference  on 
the  activation  of  certain  genes. 

However,  it  is  difficult  to  develop  models  that  can  accurately  describe  the  relationships  between 
genes.  Furthermore,  depending  on  the  gene  expression  data,  a  gene  may  not  always  activate 
given  certain  conditions  as  the  activation  of  a  gene  is  stochastic  in  nature.  Therefore,  to  develop 
such  complex  processes  one  can  use  the  paradigm  of  probabilistic  logical  programming. 
Probabilistic  logical  programming  allows  users  to  form  connections  between  otherwise 
independent  random  variables  and  develop  logical  relationships.  Furthermore,  it  provides  a 
simple  but  powerful  framework  to  implement  logical  and  statistical  techniques  (2).  Specifically 
in  the  case  of  GRNs,  probabilistic  logical  programming  allows  one  to  implement  and  facilitate 
the  structural  and  parameter  learning  of  Bayesian  networks,  which  can  provide  an  accurate  and 
flexible  means  of  modeling  GRNs  from  real  gene  expression  data  (4). 

Proteins  are  one  of  the  most  important  biochemical  compounds  in  living  organisms.  Synthesized 
from  genes,  proteins  are  composed  of  20  naturally  occurring  amino  acids  held  together  in 
polypeptide  bonds.  Proteins  serve  in  a  number  of  important  biological  processes  including 
metabolism,  where  they  serve  as  enzymes  for  catalyzing  reactions,  the  immune  response,  where 
antibodies  target  foreign  compounds  for  destruction,  and  in  cell  signaling  (5). 
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Two  major  types  of  proteins,  extracellular  and  trans-membrane,  participate  in  cell  signaling. 
Trans-membrane  proteins  span  the  cell  membrane  and  serve  as  receptors  for  signaling  molecules. 
Extracellular  proteins  carry  or  transmit  signals  across  cells.  When  an  extracellular  protein  hits  a 
receptor  on  a  cell,  it  induces  a  biochemical  response  inside  the  cell.  This  response  propagates 
through  signal  transduction  pathways,  eventually  leading  to  the  activation  of  a  gene  (6). 

It  is  important  to  recognize  that  each  protein  corresponds  to  a  certain  gene  from  which  it  is 
synthesized.  These  genes  have  different  “expression  levels”  that  can  be  quantified  in  terms  of  its 
concentration.  There  are  a  few  major  techniques  to  measure  gene  expression  levels.  The  first  is 
through  microarrays,  where  microscopic  deoxy  ribonucleic  acid  (DNA)  probes  are  placed  on  a 
slide.  When  another  slide  of  tissue  containing  messenger  ribooxy  nucleic  acid  (mRNA)  comes 
into  contact  with  the  slides,  the  mRNA  hybridizes  with  its  complementary  probe,  allowing  one  to 
measure  the  mRNA  levels.  A  significant  assumption  in  using  these  measurements  is  that  mRNA 
levels  are  directly  proportional  to  gene  expression.  Another  method  to  measure  expression  levels 
is  to  use  flow  cytometry,  which  uses  the  principles  of  light  scattering  and  light  excitation  to 
accurately  measure  absolute  protein  concentrations,  as  well  as  other  properties  (4).  The 
assumption  is  made  that  protein  activity  is  proportional  to  its  concentration.  Data  from 
experiments  for  established  GRNs  are  readily  available  from  sources  such  as  National  Institute 
of  Health. 

An  important  aspect  of  GRNs  is  that  the  expression  levels  of  one  gene  may  be  related  to  the 
expression  levels  of  another  gene,  either  indirectly  or  directly  through  the  interactions  of  their 
corresponding  proteins.  When  this  happens,  the  gene  is  said  to  “regulate”  the  other  gene  (i). 

For  example,  when  a  cell  receives  a  signal  through  a  receptor  on  its  membrane,  it  begins  the 
process  of  gene  activation  and  protein  synthesis.  A  certain  gene  is  transcribed  into  mRNA, 
translated  into  amino  acids,  and  finally  folded  into  a  structure  forming  a  protein.  This  protein 
then  goes  off  to  carry  out  other  functions,  which  may  then  trigger  the  activation  of  another  gene 
through  cell  signaling  (d). 

The  immune  response  can  be  modeled  as  a  series  of  activations  or  deactivations  of  certain 
proteins.  The  immune  system  functions  by  first  monitoring  the  body  for  any  pathogens  or 
foreign  material  (d).  If  detected,  the  immune  system  sends  a  signal  from  the  source  to  other  cells 
to  combat  the  pathogen.  This  signal  causes  the  activation  of  a  gene,  which  may  then  continue  to 
activate  another  gene,  and  so  on.  Such  interactions  between  genes  or  proteins  are  known  as 
GRNs  and  more  traditionally  known  as  signal  transduction  pathways  (i). 
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2.  MAPK/Raf  Pathway 


The  pathway  that  has  been  analyzed  in  this  research  is  the  MAPK/Raf  pathway.  This  pathway 
aids  in  human  cell  division  or  mitosis,  and  carries  a  signal  from  outside  the  cell  to  induce  protein 
synthesis  and  mitosis  inside  the  cell.  The  pathway  functions  when  some  extracellular  protein, 
known  as  a  mitogen,  hits  a  receptor  on  the  cell  membrane.  This  causes  a  bio-chemical  response 
in  the  cell  that  activates  certain  proteins,  such  as  PKC  and  PKA.  The  signal  propagates  through 
the  pathway,  hitting  important  proteins  such  as  P38,  INK,  Raf,  and  Mek.  The  signal  eventually 
reaches  Erk,  which  activates  a  transcription  factor  that  activates  a  gene,  eventually  leading 
mitosis  (7). 


3.  Bayesian  Network 


Given  biological  data,  such  as  microarray  and  flow  cytometry  data,  the  problem  is  to  develop  a 
model  that  can  represent  the  GRN  as  closely  as  possible.  Basically,  a  model  needs  to  be 
generated  from  the  real  data  that  can  both  describe  the  underlying  structure  of  the  network  and 
show  the  effects  of  certain  proteins  activation  on  the  entire  network.  As  the  activation  of  a 
certain  protein  is  probabilistic  in  nature,  the  activation  of  a  certain  gene  may  not  always  activate 
a  protein  downstream.  The  uncertainty  in  the  pathway  given  the  statistical  nature  of  the  data  has 
to  be  captured. 

The  interactions  between  these  genes  can  be  modeled  graphically  through  a  network  where  the 
nodes  correspond  to  gene  (or  protein)  names  and  the  edges  indicate  relationships.  More 
specifically,  a  type  of  network  that  can  be  used  to  model  these  complex  relationships  is  the 
Bayesian  network  (4).  Formally,  a  Bayesian  network  is  a  probabilistic  directed  acyclic  graph 
(DAG)  where  the  nodes  represent  random  variables  and  the  edges  represent  conditional 
dependencies.  Each  node  contains  a  probability  distribution  that  gives  the  conditional 
probability  of  that  node  activating  depending  on  its  parents.  Although  the  relationships  are 
statistical  in  nature,  they  can  still  be  used  to  indicate  causality  (2). 

In  the  case  of  GRNs,  a  Bayesian  network  can  be  developed  where  the  nodes  correspond  to  the 
genes  (or  proteins)  and  the  edges  represent  conditional  dependencies.  The  probability 
distribution  at  each  node  would  represent  the  probability  of  that  protein  activating  given  the 
activation/deactivation  of  its  parents. 

Bayesian  networks  are  attractive  structures  to  use  as  they  can  easily  model  complex  relationships 
and  find  the  posterior  probabilities  given  the  activation  of  certain  nodes.  Furthermore,  Bayesian 
networks  appeal  to  the  naturally  probabilistic  nature  of  signal  transduction.  A  protein,  at  least 
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from  a  high-level  statistical  standpoint,  may  not  always  activate  depending  on  certain  conditions. 
By  representing  the  uncertainty  in  activation,  Bayesian  networks  are  more  flexible  than  other 
statistical  analysis  techniques. 


4.  Methods 


The  main  learning  problem  can  then  be  stated  as  creating  a  Bayesian  network  that  models 
genetic  regulatory  networks  from  microarray  or  flow  cytometry  data.  This  problem  essentially 
reduces  to  two  sub-problems,  which  are  (1)  parameter  learning  and  (2)  structure  learning. 
Parameter  learning  consists  of  finding  the  correct  probability  distribution  for  each  node  in  the 
network.  Structural  learning,  the  more  complicated  of  the  two,  is  learning  the  actual  structure  or 
which  edges  connect  to  which  nodes.  For  structural  learning,  each  network  requires  a  score  for 
comparison.  Since  it  is  computationally  intractable  to  compare  each  possible  network,  it  is 
appealing  to  use  artificial  intelligence  techniques  to  generate  the  structure. 

There  are  a  few  issues  in  the  computations  required  for  a  Bayesian  network.  First,  the  data  might 
be  incomplete  with  hidden  or  latent  variables.  Second,  finding  the  likelihood  exactly  for  the 
network  is  impossible  in  most  cases,  since  the  integral  that  arises  is  usually  intractable.  The 
method  that  can  be  used  to  solve  this  problem  is  known  as  Variational  Bayes -Expectation 
Maximization  (VB-EM),  which  can  provide  an  approximation  to  the  marginal  likelihood  for  the 
network  by  using  the  mean  field  approximation.  Rather  than  use  a  point  estimate  for  the 
parameters.  Variational  Bayes  treats  both  the  parameters  and  any  hidden  variables  as  unknowns 
from  a  Bayesian  standpoint.  In  the  process.  Variational  Bayes  also  obtains  a  set  of  optimal 
parameters  (5).  The  estimation  to  the  marginal  likelihood,  also  known  as  the  (negative)  free 
energy,  can  be  used  as  a  score  for  model  selection.  This  technique  is  different  from  other 
methods  such  as  ME  (Maximum  Eikelihood)  and  MAP  (Maximum  a  posteriori),  which  use  point 
estimates  for  the  parameters.  Monte  Carlo  methods  are  able  to  take  random  samples  and 
evaluate  integrals  at  different  values  of  the  parameter,  but  are  computationally  intensive  (2). 

Since  the  Variational  Bayes  method  can  be  implemented  iteratively  it  is  a  method  of  choice  for 
use  in  learning  problems  (5). 

A  derivation  of  Variational  Bayes  based  on  Sato  et  al.  (9)  and  Beal  (8)  is  given  in  equation  1.  It 
is  of  interest  to  find  some  posterior  probability,  p(MID),  where  M  is  the  model  and  D  is  the 
observed  data.  Bayes  theorem  says 

p(MID)ocp(DIM)p(M).  (1) 


4 


So,  we  must  then  compute  this  likelihood,  p(DIM).  However,  first  the  model  is  parameterized  by 
some  parameters  0.  Second,  there  may  be  hidden  or  latent  variables  Z  that  have  not  been 
observed.  Then,  p(DIM)  can  be  expressed  as 

p(DIM)  =  z,9\M)de.  (2) 

0 

We  must  somehow  evaluate  this  expression  to  compute  the  marginal  likelihood.  However,  the 
above  integral  is  intractable.  So,  instead  of  considering  this  expression,  we  can  rather  consider 
the  log  of  this  expression,  L(D): 

L(D)  =  log  2^  J  p{D,  z,0\M)de.  (3) 

© 

We  can  then  re-express  this  as 

L(D)  =  log X,  J PiD, z,e\M)de  =  log  Y,^\q{z,e\  dO  ,  (4) 

0  0  I  LJ^iVl  ) 


where  q(.)  is  the  distribution  of  the  hidden  variables  and  the  parameters  given  the  data  and 
model. 


Finally,  using  this  form  of  the  equation,  we  can  then  make  an  appeal  to  Jensen’s  inequality: 

p{D,z,e\M) 


L(D)  =  log  J  P(D,  z,9\M)d9  =  \ogYj  diz,  9\D,M) 


q(z,9\D,M) 


-d6 


> 


2]J<?(z,6'ID,M)log 


p{D,z,e\M) 

q{z,e\D,M) 


(5) 


dO  . 


Jensen’s  inequality  allows  to  put  a  lower  bound  on  the  marginal  likelihood.  The  right  side  of  this 
inequality  is  known  as  F(q),  the  (negative)  variational  free  energy: 


Fid)  =  Yz\d{z,e)\og 


p(D,z,9\M) 

q{z,0) 


de. 


(6) 


Also,  if  we  subtract  L(D)-F(q): 

L(D)-F(q)  =  Y  \q(z,0)log—^^^^^d0  =  KL(q\\p),  (7) 

p(z,9\D,M) 


we  arrive  at  the  KL  divergence,  where  KL(qllp)  is  the  KL  divergence  between  q(z,6)  and 
p{z,e\D,M). 

This  means  that  maximizing  F(q)  is  equivalent  to  minimizing  the  KL  divergence  between  the 
distributions.  As  such,  the  entire  procedure  of  Variational  Bayes  can  then  be  thought  of  as 
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maximizing  F(q),  this  lower  bound  to  the  marginal  likelihood,  to  minimize  the  KL  divergence  as 
much  as  possible  to  obtain  an  accurate  approximation. 

As  stated  before,  variational  methods  typically  use  some  kind  of  approximation.  In  this  case,  we 
can  use  the  mean-field  approximation 

q(z,d]  ^  q(z]q(d]. 

If  we  plug  in  this  approximation,  we  arrive  at 


L(D)  =  log^J  p(D,z,d\M)dd  =  iogY,Jq(z,d) 


p(D,z,d\M) 

q{z,0) 


de 


> 


p{D,z,e\M) 


Yj^\qiz)q{e)\og 


qiz,0) 

p{D,z,0\M) 


de 


(8) 


qi.z)q{.e) 


de . 


Plugging  in  this  approximation  then  allows  us  to  take  the  functional  derivatives,  letting  us 
maximize  the  expression 

q{z)coQ^p{^q{e)\ogpiD,z\e,M)de) , 

© 

^(6')oop(6>IM)exp(2^^^(z)log/?(D,z  1 6>,M)) .  (9) 


This  leads  to  two  coupled  iterative  equations,  which  we  can  loop  over  until  they  converge. 

When  they  converge,  we  will  have  maximized  our  lower  bound,  F(q),  which  can  then  be  used  as 
a  score  for  model  selection. 


5.  Structural  Learning 


To  address  the  problem  of  structural  learning,  we  can  use  a  greedy  algorithm  in  place  of  an 
exhaustive  search  to  find  the  optimal  network.  Specifically,  a  hill-climbing  algorithm  that 
conducts  a  local  search  using  a  heuristic  can  be  implemented  (2).  Basically,  we  can  generate  a 
random  network  to  represent  an  initial  state.  After  that,  we  can  conduct  three  local  operations  on 
this  network,  which  include  adding,  deleting,  or  reversing  an  arc.  After  each  operation,  the  score 
of  the  network  is  determined  corresponding  to  a  fitness  function.  The  score  that  can  be  used  is 
actually  the  (negative)  variational  free  energy,  which  we  obtained  from  the  Variational  Bayes 
procedure  itself.  When  the  network  with  the  highest  score  after  the  local  operations  is 
determined,  the  hill-climbing  algorithm  moves  to  this  network  and  sets  this  as  the  current  state, 
repeating  the  local  search.  This  process  is  repeated  until  the  convergence. 
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Hill  climbing,  as  opposed  to  other  greedy  algorithms,  does  not  retain  any  information  about  its 
previous  states.  To  address  this  issue,  so  that  the  algorithm  does  not  get  stuck  on  a  local 
maximum,  the  algorithm  can  be  repeated  or  rerun  with  other  random  graphs  as  the  initial  states. 
After  that,  all  the  generated  networks  can  be  model  averaged,  where  high  confidence  arcs  from 
the  networks  can  be  selected  and  included  in  the  final  network. 


6.  Probabilistic  Logical  Programming 


To  process  data  that  is  incomplete  and  uncertain,  the  framework  of  probabilistic  logical 
programming  can  be  used  to  obtain  meaningful  relations.  It  is  a  framework  that  combines  three 
different  paradigms,  namely  first-order  logic,  statistical  learning,  and  probability.  Logical 
programming  is  the  process  of  using  first  order  logic  in  the  form  of  declarative  statements  or 
clauses  to  describe  relationships  between  entities  and  infer  their  consequences.  In  logical 
programming,  facts  are  represented  as  propositions.  The  goal  or  solution  can  be  obtained 
starting  with  the  negation  of  a  proposition  to  be  proved  and  deriving  a  contradiction  by 
systematic  deduction  using  the  facts.  Recently,  there  has  been  interest  in  extending  logical 
programming  to  include  probabilities  figure  1 .  This  approach  can  allow  for  more  realistic 
modeling  capabilities  by  taking  into  account  the  uncertainties  in  the  data  obtained  in  real  world 
that  has  significant  applications  in  machine  learning  (3). 


Lcaniiiiij 


Figure  1 .  Schematics  of  probabilistic  logic  paradigm. 

PRISM  is  a  probabilistic  logical  framework  based  on  B-prolog  the  language  extends  the  Horn 
clauses  to  include  random  variables.  The  semantics  of  the  programming  language  is  based  on 
distribution  semantics  defined  on  DB  =  {F}  U  {R},  where  {F}  is  the  set  of  atoms  with  random 
variables,  called  msw  atoms,  {R}  is  a  set  of  definite  clauses  whose  head  atoms  are  not  msw 
atoms,  and  DB  is  a  PRISM  program.  The  distribution  semantics  consists  of  defining  a 
probability  measure  Pf(.)  first  on  the  Herbrand  models  containing  the  msw  atoms.  Then  by 
Kolmogorov’s  extension  theorem  the  measure  is  extended  to  Pdb(-)  the  product  space  of  {F}  and 
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{R}.  In  practice,  PRISM  allows  users  to  easily  specify  discrete  probability  distributions  through 
its  syntax.  Furthermore,  it  supports  the  use  of  conditional  probabilities,  providing  the 
mechanisms  needed  to  model  Bayesian  networks.  Most  importantly,  it  has  support  and 
implementations  of  some  of  the  most  popular  and  complex  statistical  learning  algorithms, 
including  ML,  MAP,  and  Variational  Bayes  (9). 

To  program  a  Bayesian  network  in  PRISM,  one  has  to  generate  what  is  known  as  a  “probabilistic 
clause.”  Basically,  this  is  a  clause  that  takes  a  random  variable  and  generates  a  probabilistic  call 
on  the  variables  (9).  To  make  the  example  clearer,  consider  a  simple  probabilistic  model  that 
contains  three  binary  random  variables  A,  B,  and  C.  To  specify  the  binary  distribution  in 
prism’s  syntax,  one  can  simply  use  PRISM’s  random  “switch”  declaration:  val  ues  (_,  [0,1]). 

Next,  to  model  the  three  random  variables  a  probabilistic  call  must  be  made  to  each  of  the 
variables.  This  is  done  through  PRISM’s  version  of  the  switch  statement,  known  as  “msw”  (9). 

A  simple  probabilistic  clause  named  “world”  that  considers  the  three  variables  would  be: 

world(A,B,C):-  msw(a,A),msw(b,B),msw(c,C). 

Now  that  a  probabilistic  model  can  be  created,  generating  a  Bayesian  network  simply  requires 
one  addition:  conditional  probabilities.  This  is  easily  implemented  using  the  prolog/PRISM 
complex  functor  syntax,  which  is  a  term  with  parenthesis  containing  another  term  (9). 

So  consider  a  simple  Bayesian  network  with  three  binary  random  variables.  A,  B,  and  C,  and  let 
us  say  the  graph  consists  of  an  edge  from  A  to  C  and  an  edge  from  B  to  C.  A  probabilistic 
predicate  to  model  this  would  be: 

world(A,B,C):-msw(a,A),msw(b,B),msw(c(A,B),c). 

The  project  consists  of  generating  such  a  probabilistic  clause,  a  representation  of  a  Bayesian 
network,  which  best  models  the  real  gene  expression  data  (4).  With  this  clause,  one  can  easily 
use  PRISM  to  conduct  inference  from  this  model. 


7.  Experiment 


Flow  cytometry  data  was  obtained  for  the  well-established  “Raf  ’  pathway  in  a  discretized  format 
into  three  levels  using  one-dimensional  K-means. 

The  random  generation  method  was  implemented  where  two  nodes  are  randomly  selected  and  an 
edge  is  added  on  the  condition  that  the  graph  remains  acyclic.  This  process  is  repeated  and  the 
algorithm  outputs  a  random  graph  that  can  be  used  as  the  initial  state. 
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After  that,  the  hill-climbing  algorithm  was  implemented,  where  a  local  search  was  performed  on 
the  graph.  The  three  operations,  which  are  add,  delete,  and  reverse,  were  conducted  on  pairs  of 
nodes  to  generate  the  new  graphs  that  comprised  the  local  search  space.  The  acyclic  condition 
was  checked  for  each  of  these  three  operations  to  generate  failure-free  clauses. 

For  each  operation,  the  variational  free  energy  was  calculated  to  be  used  as  a  score.  During  the 
course  of  the  algorithm,  the  graph  was  stored  as  a  list  of  edges,  i.e.,  [edge(pKC,pKA), 
edge(raf,mek)] .  However,  to  obtain  the  free  energy,  the  graph  was  then  converted  to  a  PRISM 
program  model  to  take  advantage  of  PRISM ’s  learning  system.  A  probabilistic  predicate,  known 
as  “world,”  was  generated  based  on  the  graph  edges  and  then  saved  to  a  .psm  fde,  which  was 
then  loaded  into  the  database.  The  system  was  then  set  up  for  VM-EM  learning  through 
prism’s  syntax,  by  calling  learning  on  the  hyper-parameters  of  a  Dirichlet  distribution. 

Learning  was  begun  and  the  free  energy  was  obtained  by  calling  “leaming  statistics.” 

The  graph  with  the  highest  score  after  the  local  operations  was  selected  to  be  the  next  state.  This 
algorithm  continued  until  the  graph  reached  convergence. 

This  process  was  repeated  on  500  random  graphs  to  avoid  getting  stuck  on  local  maxima.  To 
speed  up  the  program,  a  multi-threaded  version  was  implemented  using  Java,  where  a  call  was 
made  to  PRISM  through  batch-execution,  to  run  the  hill-climbing  algorithm  on  each  random 
graph  in  a  separate  thread.  The  threads  were  run  concurrently,  causing  the  overall  time  of  the 
program  to  decrease  by  a  factor  depending  on  the  number  of  cores  in  the  computer.  It  is  possible 
to  run  this  multi-threaded  application  on  a  supercomputer  to  immensely  speed  up  the  program. 

Finally,  to  obtain  the  true  final  structure,  the  frequency  of  all  the  edges  found  in  the  500 
converged  graphs  was  recorded.  Edges  that  appeared  in  at  least  50%  of  the  structures,  or  in  this 
case  edges  that  had  a  frequency  greater  than  250  were  selected  to  be  included  in  an  averaged 
graph.  Variational  Bayes  was  run  for  a  final  time  to  obtain  the  hyper-parameters  for  this 
structure. 

To  visually  display  the  graphs,  the  Graphviz  software  was  used.  Specifically,  the  edges  in  the 
graph  structure  from  prolog  were  converted  to  Graphviz’s  “dot”  language  and  outputted  to  a  fde. 
The  command-line  dot  operator  from  Graphviz  was  called  within  PRISM  on  the  exported  file  to 
generate  a  visual  of  the  network. 

The  source  code  for  the  PRISM  and  Java  programs  is  provided  for  the  reader’s  convenience. 
Directions  on  using  the  program  are  included  in  the  readme. 
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8.  Results/Discussion 


Table  A-1  in  the  appendix  shows  the  frequency  of  the  edges  from  the  500  graphs.  Edges  that 
had  a  frequency  above  250  were  selected  to  be  included  in  the  final  averaged  model.  The  hyper¬ 
parameters  for  this  final  structure  obtained  using  Variational  Bayes  is  listed  in  the  appendix.  The 
MAPK/Raf  pathway  as  defined  by  the  literature  is  given  in  figure  2,  as  compared  to  the  final 
generated  model  given  in  figure  3. 


Activators 

1. a-CD3 

2.  u'C028 

3.  iCAM-2 

4.  PMA 

5.  p2cAMP 

Inhibitors 

6.  G06976 

7.  AKT  inh 

8.  Psitect 

9.  U0126 
10.  LY294002 


Figure  2.  MAPK  pathway  from  literature  (4). 


Figure  3.  Generated  pathway  from  algorithm. 
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The  final  structure  contained  a  total  of  22  edges.  Each  edge  and  its  accuracy  based  on  other 
models  and  literature  is  shown  in  table  1 . 

Table  1.  Accuracy  of  edges. 


Edge 

Result 

PKC->PKA 

Expected 

PKC->Raf 

Expected 

PKC->Mek 

Expected 

PKC->Erk 

Expected 

PKC->Akt 

Expected 

PKC->Plcg 

Reversed 

PKC->P38 

Expected 

PKC->Jnk 

Expected 

PKA->Raf 

Expected 

PKA->P38 

Expected 

PKA->PIP2 

Undefined/Misplaced 

PKA->Akt 

Expected 

PKA->Jnk 

Expected 

PIP3->PKC 

Misplaced 

PIP3->Plcg 

Expected 

PIP3->PIP2 

Expected 

Erk->PKA 

Reversed 

Erk->Akt 

Expected 

Mek->Raf 

Reversed 

Raf->Akt 

Expected 

P38->Jnk 

Undefined 

Plcg->PIP2 

Expected 

Mek->Erk 

Missed 

Out  of  the  22  edges,  17  are  well  defined  in  the  literature,  indicating  the  generated  model  was 
relatively  accurate.  However,  the  arc  from  PKA->Erk  was  reversed  in  the  structure.  This 
presumably  caused  the  arc  from  Mek->Erk  to  be  missed,  as  adding  this  connection  would  violate 
the  acyclic  condition.  The  arc  Mek->Raf  was  also  reversed. 

When  considering  the  phospholipids  PIP2  and  PIPS,  it  is  important  to  recognize  that  there  is  a 
feedback  mechanism  between  them.  That  is,  the  connection  between  PIP2  to  PIPS  is  direct  both 
ways  in  the  literature:  PIPS<->PIP2.  However,  the  Bayesian  network  can  only  handle  acyclic 
arcs,  so  the  arc  was  pointed  in  just  one  direction  from  PIPS->PIP2,  a  similar  result  obtained  by 
earlier  studies  (4).  Also,  this  feedback  mechanism  made  it  difficult  to  determine  where  to  place 
the  connection  between  the  phospholipids  to  PKC,  so  while  the  literature  defines  the  connection 
as  PIP2->PKC,  the  model  found  PIPS->PKC.  The  arc  between  Plcg->PKC  was  reversed  as  well. 
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Finally,  additional  arcs  were  added  between  P38->Jnk  and  PKA->PIP2,  both  of  which  are 
plausible  but  must  be  verified  through  literature  and  experimentation. 

When  comparing  the  results  with  Sachs  et  al.  (4)  (who  conducted  the  original  experiment),  the 
models  appear  very  similar.  Fourteen  out  of  seventeen  arcs  in  Sachs  et  al.  (4)  results  directly 
match  the  generated  model.  Also,  Sachs  et  al.  (4)  reported  two  not  well-defined  connections, 
PKC->PKA  and  Erk->AKT,  which  were  corroborated  by  this  generated  model.  However,  while 
Sachs  et  al.  (4)  reversed  the  connection  between  PIP3->Plcg,  the  generated  model  actually 
correctly  identified  the  direction  of  the  connection.  Furthermore,  while  Sachs  et  al.  (4) 
completely  missed  the  arcs  from  PIP2->PKC  and  Plcg->PKC,  the  generated  model  found  a 
connection  from  PIP3->PKC  and  reversed  the  connection  to  find  PKC->Plcg.  Sachs  et  al.  (4) 
also  did  not  find  the  connection  from  PIP3->Akt,  while  the  generated  model  indirectly  found  the 
connection  through  PIP3->PKC->Akt.  This  might  give  further  insight  as  to  why  the  connection 
was  added  from  PIP3->PKC  rather  than  PIP2->PKC,  since  adding  the  connection  from 
PIP3->PKC  made  the  indirect  connection  PIP3->PKC->Akt  possible. 

When  considering  the  results  as  a  whole,  the  generated  graph  was  remarkably  similar  to  the 
literature.  Only  one  arc,  Mek->Erk,  was  completely  missed  because  of  the  acyclic  condition,  and 
a  few  arcs  were  misplaced  or  reversed.  Although  this  missed  connection  between  Mek->Erk  was 
found  by  Sachs  et  al.  (4),  the  generated  model  was  successfully  able  to  identify  relationships 
between  the  phospholipids  that  Sachs  et  al.  (4)  missed  or  reversed.  This  shows  that  the  hill¬ 
climbing  method  can  be  successfully  utilized  as  compared  to  the  simulated  annealing  approach 
Sachs  et  al.  (4)  used. 


9.  Conclusions 


The  first-order  probabilistic  logic  framework  as  implemented  in  PRISM  was  successfully 
utilized  to  facilitate  both  parameter  and  structural  learning  of  Bayesian  networks  to  model  GRNs. 
This  approach  was  applied  to  the  “Raf  ’  pathway,  and  a  structure  that  is  in  good  agreement  with 
the  literature  was  generated.  Variational  Bayes  was  utilized  for  parameter  learning  and  to  obtain 
an  approximation  to  the  marginal  likelihood.  The  free  energy  was  used  as  a  fitness  function  for 
model  selection  in  structural  learning. 

The  final  structure  generated  by  the  algorithm  was  very  close,  with  only  a  few  arcs  missing  or 
reversed.  Both  direct  and  indirect  connections  between  genes  were  detected,  indicating  how 
Bayesian  networks  can  adapt  to  hidden  or  unobserved  variables.  Eurthermore,  the  algorithm  was 
able  to  identify  connections  previously  missed  by  other  research.  Such  probabilistic  reasoning 
can  be  successfully  used  on  flow  cytometry  data,  where  a  hill-climbing  approach  can  be 
implemented  to  generate  the  Bayesian  networks.  PRISM  was  also  found  to  be  a  flexible  and 
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stable  platform  that  can  be  used  to  implement  powerful  and  applicable  learning  algorithms 
through  simple  syntax. 

A  number  of  techniques  can  be  used  to  extend  or  change  the  project.  First,  a  different 
discretization  technique  can  be  applied  on  the  data  to  obtain  more  accurate  results.  Also,  results 
between  flow  cytometry  and  microarray  data  can  be  compared.  Furthermore,  the  base 
implementation  of  PRISM  only  supports  discrete  distributions  for  Bayesian  networks.  The 
framework  could  be  extended  to  include  continuous,  perhaps  linear,  Gaussian  distributions  and 
attempt  learning  using  this  broader  class  of  probability  distributions.  Most  importantly,  Bayesian 
networks  are  defined  as  DAGs,  or  contain  the  acyclic  condition.  However,  it  has  been  noted  that 
the  “Raf  ’  pathway  that  was  tested,  as  well  as  numerous  other  biological  pathways,  included  self¬ 
feedback  mechanisms.  Alternative  approaches  can  be  used  to  address  this  issue,  such  as  the 
implementation  of  dynamic  Bayesian  networks,  whose  states  are  temporal.  Given  time-series 
data  for  gene  activation,  this  approach  can  be  readily  applied.  This  provides  numerous 
opportunities  for  further  research  and  investigations. 

The  project  explored  the  applicability  of  probabilistic  logical  programming  as  a  platform  to 
analyze  data  and  generate  models  for  GRNs.  Bayesian  networks  can  be  used  to  identify 
statistical  as  well  as  causal  relations,  making  them  valuable  inference  models.  The  PRISM 
framework  along  with  prolog  is  a  flexible  language  that  can  easily  be  integrated  in  a  wide  variety 
of  “omics”  data,  including  genomics,  proteomics,  and  metabolomics.  Most  significantly,  an 
implementation  of  such  an  algorithm  can  be  used  to  analyze  new  and  old  data  to  help  identify 
connections  and  networks  that  either  have  not  been  identified  or  have  been  overlooked. 

Modeling  GRNs  using  these  methods  has  enormous  applications  in  medication  and  drug 
delivery.  Automatically  generating  probabilistic  models  from  real  data  can  help  scientists 
explore  the  effect  of  activating/inhibiting  certain  genes.  By  identifying  and  simulating  important 
proteins  and  pathways,  researchers  can  use  the  pathways  to  develop  highly  effective  medication 
and  drug  delivery  techniques.  Targeting  specific  proteins  and  transduction  pathways  can  help 
lead  to  solutions  for  a  number  of  diseases,  including  cancer,  HIV/Aids,  genetic  disorders,  etc. 
Further  research  in  Bayesian  approaches  using  probabilistic  logical  programming  can  help 
establish  methods  to  automatically  analyze  the  large  amounts  of  available  biological  data  and 
create  robust  statistical  and  probabilistic  models  to  conduct  inference. 
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Appendix.  Dirichlet  Posteriors 


PRISM  is  a  framework  that  supports  statistical  learning  of  parameters  that  would  correspond  to  a 
family  of  probability  distributions  to  model  given  data.  This  is  done  by  assuming  a  set  of  priors 
for  the  Bayesian  network  and  outputting  a  set  of  posteriors  after  processing  the  data.  At  the  end 
of  a  successful  execution  of  the  learning  paradigm  after  processing  the  evidence  data,  PRISM 
outputs  the  hyper-parameters  and  the  ML  edges  connecting  the  nodes  of  the  Bayesian  network. 
The  frequencies  of  the  edges  and  the  corresponding  hyper-parameters  from  the  successful  run  are 
listed  in  table  A-1. 
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Table  A-1.  Frequency  of  edges  from  500  random  graphs. 


Frequency 

Edge 

Frequency 

Edge 

Frequency 

Edge 

384 

PKC->Jnk 

44 

Plcg->Erk 

10 

Erk->Jnk 

383 

PKC->Raf 

38 

P38->Erk 

10 

Akt->P38 

383 

PKC->Akt 

37 

PIP3->P38 

9 

PIP2->Jnk 

355 

PKC->PKA 

37 

Jnk->PIP3 

9 

Jnk->PIP2 

347 

PKA->P38 

36 

Plcq->PKA 

8 

PIP2->Raf 

344 

PKC->Mek 

36 

Mek->Erk 

8 

PIP2->Mek 

330 

Plcq->PIP2 

33 

Plcq->P38 

7 

Raf->Plcg 

329 

Erk->PKA 

33 

Mek->PKA 

7 

Raf->Jnk 

327 

Raf->Akt 

30 

PKA->Plcg 

6 

PIP3->Jnk 

326 

PKA->Raf 

30 

Erk->Plcg 

5 

Akt->Plcg 

320 

PKC->P38 

26 

Mek->Plcg 

4 

PIP2->P38 

311 

PKC->Erk 

26 

Erk->PIP2 

3 

P38->Raf 

311 

PIP3->PIP2 

25 

Plcg->Akt 

2 

Jnk->Raf 

310 

PKA->Jnk 

25 

PIP2->PKC 

1 

Raf->P38 

308 

P38->Jnk 

23 

Plcg->Mek 

304 

PIP3->Plcg 

22 

PIP3->Akt 

286 

PKC->Plcg 

22 

Jnk->Erk 

283 

PKA->Akt 

21 

Jnk->Plcg 

276 

PKA->PIP2 

21 

Akt->Mek 

271 

Mek->Raf 

20 

PIP3->Raf 

264 

PIP3->PKC 

20 

PIP2->Erk 

255 

Erk->Akt 

19 

Mek->P38 

244 

Akt->Erk 

19 

Erk->Mek 

229 

PKC->PIP3 

18 

Erk->Raf 

228 

Raf->Mek 

17 

Raf->Erk 

216 

Akt->PKA 

17 

Mek->Jnk 

195 

Plcg->PIP3 

17 

Jnk->PKA 

191 

Jnk->P38 

16 

PKA->Mek 

188 

PIP2->PIP3 

16 

PIP2->Akt 

176 

PKC->PIP2 

16 

P38->Mek 

175 

Erk->PKC 

16 

Erk->P38 

173 

Raf->PKA 

15 

P38->Plcg 

172 

Plcq->PKC 

15 

Jnk->Mek 

170 

PKA->Erk 

15 

Jnk->Akt 

169 

PIP2->Plcg 

14 

Raf->PIP3 

129 

PKA->PKC 

14 

Plcg->Jnk 

110 

P38->PKA 

14 

PIP2->PKA 

100 

Raf->PKC 

13 

PIP3->Erk 

83 

Mek->PKC 

13 

P38->Akt 

78 

P38->PKC 

12 

Akt->PIP2 

77 

Erk->PIP3 

11 

PIP3->Mek 

70 

Akt->Raf 

11 

Mek->PIP2 

64 

PKA->PIP3 

11 

Mek->Akt 

62 

Akt->PKC 

11 

Akt->Jnk 

61 

P38->PIP3 

10 

Raf->PIP2 

59 

Mek->PIP3 

10 

Plcg->Raf 

56 

Akt->PIP3 

10 

PIP3->PKA 

51 

Jnk->PKC 

10 

P38->PIP2 

16 


Hyper-parameters  for  Final  Structure: 


switch (akt (3, 3,2,2) , unfixed,  [1,2,3],  [2 . 1 04 78 10 72 1 91722e+01 , 1 . 302 41 45 90 88 4339e+01 , 2 . 12 913 68 665510 05e+00 ] ) 
switch {akt {3, 2, 2, 2) , unfixed, [ 1 , 2 , 3 ] , [2 . 7 02 10 34 9554 17 60e+01 , 2 . 610 94 02 10 60 0353e+01 , 7 .648154374561678e-03]) 
switch {akt {3, 1,2,2) , unfixed,  [  1 , 2 , 3 ] ,  [ 8 . 904 40 02 0 6 98 98 99e+01 , 5 . 0 38 7334 61 9920 4 9e+00 , 1 . 6 61 4322 8550 9123e-02 ] ) 
switch {akt {3, 1,2, 1) , unfixed,  [1,2,3],  [ 5 . 68 611 54 57 08 72 60e-02 , 1 . 4 9574 5678 542 937e-02 , 1 . 0 08 150350 52 40 13e+01 ] ) 
switch {akt {3, 1, 1, 1) , unfixed, [1,2,3], [1. 90 922 62 34 05502 6e+01, 2. 00100 37 367958 94e+01, 5. 0 33884889 92 18 50e+00]) 
switch {akt {2, 3, 3, 3) , unfixed, [l,2,3],[9.544254268681751e-02,4.010649754500932e+00,2.063750589049270e+00]) 
switch {akt {2, 3, 3,2) , unfixed,  [1,2,3],  [  6 . 0 50 7 611 68 37 00 66e+00 , 3 . 501 14 412 91 922 12e+01 , 1 . 2 0 6 6502 84 57 62 36e+01 ] ) 
switch {akt {2, 3,2, 3) , unfixed, [1,2,3], [2. 023 651088 92 91 81e+00, 2. 300 97 452887 12 80e-02, 3. 5272559 642741 90e-02]) 
switch {akt {2, 3,2,2) , unfixed,  [1,2,3],  [1 . 65027 7 61347 64 13e+02 , 4 . 603 67 1 94310 10 45e+01 , 4 . 90 680 7508 6324 33e+01 ] ) 
switch {akt {2,2, 3, 3) , unfixed,  [1,2,3],  [ 8 . 0 31 3900 0 9854023e-02 , 1 . 0 42 94 11 56 92 4351e+00 , 2 . 7 7581 952 9 60 3951e-02 ] ) 
switch {akt {2,2, 3,2) , unfixed,  [1,2,3],  [7 . 63512 60 7828 0541e-02 , 1 . 0 01 4503 62 44 8501e+01 , 1 . 02 97507 56 9331 85e+00 ] ) 
switch {akt {2,2,2, 3) , unfixed, [1,2,3], [1. 50811444538 04 54e+01, 6 . 1 3971 7 9 93 942 639e-04 , 1 . 0 12 40 97 642 630 14e+00 ] ) 
switch {akt {2, 2, 2, 2) , unfixed,  [1,2,3],  [3. 620 35881444 38 0  9e+02, 5. 00370844800 02 58e+01,  9. 70139 93 97 950583e+01]) 
switch {akt {2, 1, 3, 1) , unfixed,  [1,2,3],  [ 7 . 587 02 78 67 54 82 7 9e-03 , 1 . 325 60 5952 552555e-02 , 1 . 1 57 0 67 9 97 4 610 93e+00 ] ) 
switch {akt {2, 1,2, 3) , unfixed, [l,2,3],[1.030104757437537e+00,4. 98 6257 948 5557 39e-02 , 8 . 7 3530 1 9 99 62 81 31e-02 ] ) 
switch {akt {2, 1,2,2) , unfixed,  [1,2,3],  [1 . 620 52 8 975 97 6645e+02 , 9 . 0 52 37 31 3356 68 1 9e+00 , 4 . 0 44 82 4724 31 30 71e+00 ] ) 
switch {akt {2, 1,2, 1) , unfixed,  [1,2,3],  [  6 . 322 1351 6353 9344e-02 , 1 . 1 44 05 61 6024 69 98e-02 , 3 . 0 04 57 6601 11 82 58e+01 ] ) 
switch {akt {2, 1, 1, 1) , unfixed, [1,2,3], [ 6 . 4 01 42 0 6 66732027e+01 , 8 . 507 32 8 9 91 5824 74e+01 , 3 . 4 02 0304 57 02 5943e+01 ] ) 
switch {akt {1, 3, 3, 3) , unfixed,  [1,2,3],  [7. 62 98874431972 81e-02, 5. 1 02241602 3987 82e+01, 4. 013531999235901e+01]) 
switch {akt {1, 3, 3,2) , unfixed,  [l,2,3],[3.7  0218  82  37  8634  37e+01,1.3  90  5817  63  968984e+02,5.90197  0134  92  0614e+01]) 
switch {akt {1, 3,2, 3) , unfixed, [l,2,3],[6.200131723214648e+01,l. 913 64 96 90 30 71 36e+01 , 1 . 1 11 0 611 50 41 3528e-02 ] ) 
switch {akt {1, 3,2,2) , unfixed,  [1,2,3],  [ 8 . 120 36 98 38 15 6221e+02 , 1 . 2 70 50 67 44 13 9521e+02 , 1 . 4 50 17 7 631 94 392 9e+02 ] ) 
switch {akt {1,2, 3, 3) , unfixed,  [1,2,3],  [2 . 7 62 72 1 948 18 42 77e-02 , 2 . 1 00 6422 4 9 662 97 9e+01 ,  6 . 0 0 60520 95 90 94 94e+00 ] ) 
switch {akt {1,2, 3,2) , unfixed,  [1,2,3],  [ 5 . 0 87 355558 00 42 1 9e+00 , 1 . 517 88 6338 4 60 945e+01 , 5 . 1 03 6318 14 11 35 93e+00 ] ) 
switch {akt {1,2,2, 3) , unfixed, [1,2,3], [ 3 . 580 37 17 37 61 1 94 6e+02 , 1 . 50 681 8532 0381 99e+01 , 4 . 221 18 1 654 7504 50e-02 ] ) 
switch {akt {1,2, 2, 2) , unfixed, [l,2,3],[1.158083907679590e+03,6.204915110131702e+01,2.801120823110234e+02]) 
switch {akt {1, 1, 3, 1) , unfixed,  [1,2,3],  [2 . 8 78 5547 88 04 62 39e-02 , 1 . 0 9310 98 10252222e-01 , 1 . 0 31 32 9958 00 6125e+00 ] ) 
switch {akt {1, 1,2, 3) , unfixed,  [1,2,3],  [8. 011  62 38 13061377e+00, 4. 30448 635650 96 62e-02, 1.6 632479 9375 90 54e-03]) 
switch {akt {1, 1,2,2) , unfixed,  [1,2,3],  [  1 . 340 00 40 12 3975 98e+02 , 4 . 0 3551 10 80 5614 04e+00 , 2 . 1 0 6812 918 32 422 9e+01 ] ) 
switch {akt {1, 1,2, 1) , unfixed,  [1,2,3],  [7 . 98358 84 00 11 3573e-02 , 3 . 94 674 37 98 642527e-02 , 6 . 7 01 91 4 6 98 95 64 55e+01 ] ) 
switch {akt {1, 1, 1, 1) , unfixed, [1,2,3], [ 1 . 0 00 302 912 47 59 63e+02 , 1 . 2 60 11 60255573 60e+02 , 3 . 8 07 53 99 69 64 12 56e+01 ] ) 
switch {raf {3,2, 3) , unfixed,  [  1 , 2 , 3 ] ,  [  6 . 653 6325202 974 48e-02 , 3 . 12 948 62 034358 72e-02 , 1 . 0 56 938302 61 02 41e+00 ] )  . 
switch {raf {3,2,2) , unfixed,  [1,2,3],  [  1 . 2 7 6 92 51 11 15 91 4 6e-01 , 2 . 551 82 69357 914 57e-02 , 1 . 0 662 6 63 64 7557 04e+00 ] )  . 
switch {raf {3, 1, 1) , unfixed,  [1,2,3],  [  9 . 1 772581 84 1 64 905e-02 , 2 . 2 02 78 83 92 6537 14e+01 , 5 . 4 04 7020 33 91 7 637e+01 ] )  . 
switch {raf {2,3,3) , unfixed, [1,2,3], [1. 013878502 91 9621e+00, 1. 0668334 75 67 0973e+00, 1.95608 97 730972 34e-01]). 
switch {raf {2,3,2) , unfixed,  [1,2,3],  [  1 . 0 1 982 40 0 67 64122e+00 , 2 . 0 34 0324 8 941 98 0 6e+00 , 8 . 37518 04 6127 87 64e-02 ] )  . 
switch {raf {2,2,3) , unfixed,  [1,2,3],  [ 3 . 351 64 1 677 91 72 83e-02 , 1 . 8 07 95 968 90 632 84e+01 , 2 . 027 9387 15 90 4 647e+00 ] )  . 
switch {raf {2,2,2) , unfixed, [1,2,3], [ 3 . 0 91 634 980 61 68 0 6e+00 , 5 . 8 058358 14 4027 66e+01 , 1 . 0 18 73 97 7 6 6328 71e+00 ] ) . 
switch {raf {2,2,1) , unfixed,  [1,2,3],  [  1 . 4 14 1325 90 10 4125e-01 , 2 . 0 44 00 57 78 01 47 4 9e+00 , 7 . 7 9681 1 6 92 92 1 67 9e-04 ] )  . 
switch {raf {2,1,1) , unfixed,  [1,2,3],  [1.31006 65398877 82e+02,l. 920 55 62 3518 77 34e+02 , 5 . 5 9588 555828 01 8 9e-02 ] )  . 
switch {raf {1, 3, 3) , unfixed, [1,2,3], [ 1 . 7 10 1583 98 81 6957e+02 , 7 . 0 054347 52 8372 83e+00 , 8 . 91552 47 872 6 67 15e-02 ] ) . 
switch {raf {1, 3,2) , unfixed,  [1,2,3],  [ 3 . 9 90 6900 77 94 1 641e+02 , 1 . 50 674 12 31 5980 41e+01 , 1 . 7 31 682 980 632 97 6e-02 ] )  . 
switch {raf {1, 3, 1) , unfixed,  [1,2,3],  [ 8 . 0 962 61227 54 77 82e+00 , 1 . 0 98 50 6550 842028e+00 , 1 . 0 1 9592 947 71 42 85e-01 ] )  . 
switch {raf {1,2, 3) , unfixed, [l,2,3],[1.319073469920082e+03,2. 950 56 98 4 971 68 13e+02 , 3 . 304 022587 04 77 38e+01 ] ) . 
switch {raf {1,2,2) , unfixed,  [1,2,3],  [1 . 522038786515532e+03, 4 . 620 18 6580 8 90551e+02 , 5 . 1 00 1521 12 35 68 64e+01 ] )  . 
switch {raf {1,2, 1) , unfixed,  [1,2,3],  [  1 . 5 90 84 1 67 912 90 34e+02 , 1 . 7 30 5552 43 96360 6e+02 , 9 . 4 0425 69 9304 6322e+01 ] )  . 
switch {raf {1, 1, 1) , unfixed,  [1,2,3],  [2 . 0 10 5601 5727 3320e+02 , 2 . 4 04 1 9 97 71 4318 47e-02 , 1 . 0 6654 1353855648e-01 ] )  . 
switch {pIP2 {3,2,3) , unfixed, [1,2,3], [ 4 . 00 12 0 98 938 1311 Oe+0 0 , 1 . 70 3542 80 6927 80 4e+0 1 , 4 . 80 64 30 60 772 6 62 4e+0 1 ] ) . 
switch {pIP2 {3,2,2) , unfixed, [1,2,3], [1. 910316248831325e+01, 1. 91 4372 83 91 64 78 7e+0 1 , 1 . 10 682 944 66 65312e+0 1 ] ) . 
switch {pIP2 {3,1,3) , unfixed, [1,2,3], [ 3 . 40 558 6 94 97 4 627 8e+0 1 , 2 . 70 37 95 98 47 07 132e+0 1 , 4. 9022 93 91 91 60 90 7e+0 1 ] ) . 
switch {pIP2 {3,1,2) , unfixed, [1,2,3], [ 1 . 0 920 05458339 96 9e+0 3, 4 . 280639217816594e+02, 1 . 00 030 6452511 782e+02] ) . 
switch {pIP2 {2,3,3) , unfixed, [1,2,3], [ 5 . 30 37 950 674 93 67 7e-0 3, 6 . 17 03 60 632 643512e-02 , 2 . 12 11 54 1 9 94 68 8 94e+0 0 ] ) . 
switch {pIP2 {2,2,3) , unfixed, [1,2,3], [ 2 . 0 659 60 56 97 95757e+0 0 , 3 . 030570 5907 1222 8e+0 0 , 5 . 20 4 677 660508 04 9e+0 1 ] ) . 
switch {pIP2 {2,2,2) , unfixed, [1,2,3], [ 1 . 30 742 90 957 48 78 3e+0 1 , 3 . 71 00 93 93 62 92 0 90e+0 1 , 2 . 00 92 532341 58 4 98e+0 1 ] ) . 
switch {pIP2 {2,1,3) , unfixed, [1,2,3], [ 5 . 70 77 3542 48 4 6452e+0 1 , 3. 60 8330 93802334 3e+0 1 , 7 . 300007524174188e+01] ) . 
switch {pIP2 {2,1,2) , unfixed,  [1,2,3],  [  1 . 5630 52 6220 36350e  +  0 3,  5 . 50 058378 55 9357 4e  +  02 ,  1 . 07 033321 033027 7e  +  02] )  . 
switch {pIP2 {1,3,3) , unfixed, [1,2,3], [ 5 . 18 9687 70 42 4521 6e-0 3, 1 . 3902 12 08 91 04 61 4e-0 1 , 2 . 04 13 64 4 6 68 7 9335e+0 0 ] ) . 
switch {pIP2 {1,3,2) , unfixed, [1,2,3], [ 9 . 42 34 44 990123153e-0 3, 2 . 6950 17 99 90 5722 5e-02 , 2 . 0121 54 1 62378 58 4e+0 1 ] ) . 
switch {pIP2 {1,3,1) , unfixed, [1,2,3], [ 4 . 24 638 970 6613352e-02 , 4 . 13 9422 98 90 6342 4e-02 , 1 . 00 04 50 57 4320 960e+02 ] ) . 
switch {pIP2 {1,2,3) , unfixed, [1,2,3], [ 1 . 04 3935 90 0 938 11 Oe+0 0 , 1 . 1124 92 6420 73 998e+0 0 , 1 . 01 3314 54 45 91 72 6e+0 0 ] ) . 
switch {pIP2 {1, 2, 2) , unfixed, [1,2,3], [ 5 . 04 0 9 6583032513 9e-02 , 3 . 04 142 9338 9 91 394e+0 0 , 1 . 034337 8721 658 90e+0 0 ] ) . 
switch {pIP2 {1,2,1) , unfixed, [1,2,3], [ 1 . 00 1347 8374 7 9365e-0 1 , 2 . 2301 9584 60 8311 4e+02 , 3. 902331 9658 36363e+0 1 ] ) . 
switch {pIP2 {1,1,3) , unfixed, [1,2,3], [ 1 . 00 8582 97 6980 83 9e+0 0 , 7 . 048285773743402e+00, 7 . 144418272563652e+00] ) . 
switch {pIP2 {1,1,2) , unfixed, [1,2,3], [ 3 . 4 6037 915 90 40 597e+02 , 6 . 10 360 940 64 1517 Oe+0 1 , 1 . 20 47 44 42 60 00 652e+0 1 ] ) . 
switch {pIP2 {1,1,1) , unfixed, [1,2,3], [ 1 . 2 904 11 94 58 43202e+02 , 8 . 30 9607 52 12 13042e+0 1 , 7 . 02 31 95557334 350e-02 ] ) . 
switch {jnk{ 3, 3,2) , unfixed,  [1,2,3],  [ 8 . 1 70 4304 54 9944 60e-02 , 1 . 024 68 84 3943 92 17e+00 , 2 . 374 3678 3631 772 6e-03 ] )  . 
switch {jnk {3, 2, 3) , unfixed, [1,2,3], [3. 711132573 64 1532e+01, 8. 399533624281141e-02,7.386758653356984e-02]). 
switch {jnk {3, 2, 2) , unfixed,  [1,2,3],  [  1 . 0 131 9 91 1 6552721e+00 , 1 . 0 00 10 38 6824 9331e+00 , 1 . 1 34 6643 67 18 10 37e-02 ] )  . 
switch {jnk {3, 1, 1) , unfixed,  [1,2,3],  [  1 . 0 564381 6584 8 98 9e+00 , 1 . 4 0 67 97 63320 82 7 6e+01 , 8 .602  38288615  6428e+01])  . 
switch {jnk {2, 3, 3) , unfixed,  [1,2,3],  [2. 11447 37 1221 54 70e+00, 7. 6732724 9981 5553e-02 , 2 . 0 0 94 621 3822 8 633e+00 ] )  . 
switch {jnk {2, 3,2) , unfixed,  [1,2,3],  [6. 0 7228 68 0727 9122e+00,  9. 0295600 57 54 8634e+00, 1.0078384 90 50 8448e+01]). 
switch {jnk {2, 3, 1) , unfixed,  [1,2,3],  [  1 . 0 84 37 56590 918 77e+00 , 5 . 1 07 31 05053924 59e-02 , 3 . 2 84 84 7 657 072554e-02 ] )  . 
switch {jnk {2, 2, 3) , unfixed,  [1,2,3],  [  9 . 4 01 72 4721 4028 7 9e+01 ,  6 . 0 42 47 97 71 44 50 67e+00 , 2 . 1 051553361222 4 9e+00 ] )  . 
switch {jnk {2, 2, 2) , unfixed, [1,2,3], [ 1 . 0 07 31 4 624 1 607 50e+01 , 1 . 50 94 62 68 620 74 4 9e+01 , 6 . 0 42 42 31 0 601 6300e+00 ] ) . 
switch {jnk {2, 2, 1) , unfixed,  [1,2,3],  [  1 . 0 0 652 961320 7 685e+00 , 7 . 2 98 8 95301 18 72 80e-02 , 1 . 0 97 58 8 94 61302 Ole-01 ] )  . 
switch {jnk {2, 1, 1) , unfixed,  [1,2,3],  [  1 . 1 50 66 64 44 50 78 75e+02 , 1 . 621 057 90 67 9052 6e+02 , 4 . 4 0 911 4 920 38 50 93e+01 ] )  . 
switch {jnk {1, 3, 3) , unfixed, [l,2,3],[1.190916815111105e+02,5.506560385800171e+01,2.091981171398936e+00]). 
switch {jnk {1, 3,2) , unfixed,  [1,2,3],  [2 . 720 04 77 60 1540 43e+02 , 1 . 1 70 00 05 9973137 6e+02 , 2 . 0 91 5985172 61 623e+00 ] )  . 
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switch { jnk{l, 3, 1) , unfixed,  [1,2,3],  [ 8 . 0 77 91 00 48 07 97 42e+00 , 5 . 3542 64384 1518 52e-02 , 3 . 52022 8501 3631 7 6e-02 ] ) 
switch {jnk {1,2, 3) , unfixed, [1,2,3], [ 1 . 340 10 91 93565301e+03 , 1 . 8 02 04 8 611 54 7423e+02 , 9 . 0 98 32 60 77 8 97324e+00 ] ) 
switch {jnk {1,2, 2) , unfixed,  [1,2,3],  [  1 . 7 02 08 50 13 61 9618e+03 , 3 . 380 0123 6614 64 35e+02 , 2 . 50547 03 90 98 47 96e+01 ] ) 
switch {jnk {1,2, 1) , unfixed,  [1,2,3],  [ 3 . 7 81 03 6522 854555e+02 , 4 . 601735814093122e+01, 3. 02441991403 97 37e+00]) 
switch {jnk {1, 1, 1) , unfixed, [1,2,3], [ 1 . 120 10 8347 30 0325e+02 , 3 . 8 0 61122 70 30 54 65e+01 , 2 . 8 01 08 5501 6554 14e+01 ] ) 
switch {plcg{3, 3) , unfixed, [1,2,3], [ 4 . 0 62373 9304 392 95e+0 0 , 3 . 12 1 674 37 4 90305 6e+0 1 , 4 . 61 50 50 78 01 52 78 5e-0 3] ) . 
switch {plcg{3, 2) , unfixed,  [1,2,3],  [ 2 . 850503 90 6638 14 Oe  +  02 ,  9 . 81 39 63 657 92824 le  +  0 1 , 2 . 17 91 80 63202552 7e  +  0 0 ] )  . 
switch {plcg{3, 1) , unfixed, [1,2,3], [2 . 085593842284205e+00, 4 . 3651 87 1220 98 94 6e-02 , 2 . 035541 3618 92 42 9e+0 0 ] ) . 
switch {plcg{2, 3) , unfixed, [1,2,3], [ 5 . 42 11 930 6 90 92 512e+02 , 2 . 911033672775044e+01, 7 . 7122 51 70 31 99735e-02 ] ) . 
switch {plcg{2, 2) , unfixed, [1,2,3], [ 3 . 71 602 90 6 6902 82 7e+0 3, 9 . 30 38 8224 8 688 838e+0 1 , 7 . 2 92427 98 42 0572 9e-02 ] ) . 
switch {plcg{2, 1) , unfixed, [1,2,3], [ 1 . 04 67 1 914 62 84 04 4e+0 0 , 1 . 12 51 62 630332 833e+0 0 , 2 . 00 94 4 9562128 57 5e+0 1 ] ) . 
switch {plcg{l, 1) , unfixed, [1,2,3], [ 2 . 1220 822 651 66 68 Oe+02 , 2 . 620912551646054e+02, 1 . 00 00 0 62 97 94 9 995e+02 ] ) . 
switch {pKA{3, 3) , unfixed, [l,2,3],[1.446217272069794e-02,2.800429836280441e+01,9.706578321660727e+01]). 
switch {pKA{3,2) , unfixed,  [1,2,3],  [  1 . 4 01 80 04 48 152 959e-01 , 3 . 607 8344 54 87 58 15e+01 , 2 . 8 80 3421 8 61 618 95e+02 ] )  . 
switch {pKA{3, 1) , unfixed, [1,2,3], [2. 0 386909 94 437913e+00, 3. 579598885197188e-02, 3. 138 774652 535936e-02]). 
switch {pKA{2, 3) , unfixed, [l,2,3],[9.000996203824691e+00,3.890280244728064e+02,8.303463308550330e+01]). 
switch {pKA{2,2) , unfixed,  [1,2,3],  [ 4 . 2 80 07 90 534332 36e+02 , 2 . 0 62 02 75 91 8 978 90e+03 , 1 . 380 0 68 9 91 72 65 92e+03 ] )  . 
switch {pKA{2, 1) , unfixed, [1,2,3], [1. 07040117652 1088e+02, 9 . 933 64 3024 07 6355e-02 , 7 . 347 08 857 6 95 6984e-02 ] ) . 
switch {pKA{l, 1) , unfixed,  [1,2,3],  [4 . 910 3942 18 88 11 92e+02 , 2 . 3 92238 9532 941 90e-02 , 7 . 1 1 91 918 78 955555e-02 ] )  . 
switch {p38 {3,3) , unfixed,  [1,2,3],  [1. 76163757342 4001e+02, 4. 1309198 4977 5311e+00, 3. 685 93590651 5314e-02]). 
switch {p38 {3,2) , unfixed,  [1,2,3],  [ 3 . 910 58 04 30 64 07 98e+02 , 2 . 50352 88 74 4 9 67 0 6e+01 , 1 . 0 14 3578 47 34 68 42e+00 ] )  . 
switch {p38 {3,1) , unfixed, [1,2,3], [8. 14537 433950 7320e+00,1.086558533401370e+00,1.059189294959750e-01]). 
switch {p38 {2,3) , unfixed,  [1,2,3],  [1.52 9061531 42 9524e+03, 1.020754013808111e+02, 3.70641844721 3304e+01]). 
switch {p38 {2,2) , unfixed,  [1,2,3],  [2 . 0 6500 78 93 68 3377e+03 , 3 . 1 01 62 7 6 90 10 42 1 6e+01 , 2 . 0257228 54 42 1 6 60e+00 ] )  . 
switch {p38 {2,1) , unfixed, [1,2,3], [ 4 . 2 7020 6724 67 91 93e+02 , 1 . 0 48 75 697518 7821e+00 , 1 . 2 34 92 38 52 0 97 6 69e-02 ] ) . 
switch {p38 {1,1) , unfixed,  [1,2,3],  [1. 780089094 06 602 9e+02, 3.2115903 68 82 8645e+02, 1.010 02 0740993941e+02]). 
switch {pKC {3) , unfixed,  [1,2,3],  [4. 0001159 67 1938 32e+00, 3. 8 50 7713158930 66e+02, 3. 5140977 33 98 9625e+01]). 
switch {pKC (2) , unfixed, [1,2,3], [2 . 2 02 11 554 92 985 99e+01 , 3 . 8 0 904 71 77 5027 7 9e+03 , 5 . 7 10 82 9507 41 38 82e+02 ] ) . 
switch {pKC {1) , unfixed,  [1,2,3],  [5. 740963616 63 61 88e+02, 3. 88682 52 38 351088e-02, 8. 7832168 70598354e-02]). 
switch {mek{3) , unfixed,  [  1 , 2 , 3 ] ,  [  6 . 0 10 54 51 87 64 7 9 92e+02 , 5 . 0 02 4128 91 691504e+00 , 2 . 932 34 41 95 90 4 6 68e-02 ] )  . 
switch {mek{2) , unfixed,  [1,2,3],  [ 4 . 1 08 0 643 90 40 8 638e+03 , 8 . 4 131501 78 52 6675e+01 , 2 . 1 0580 9311 40 3344e+00 ] )  . 
switch {mek{l) , unfixed,  [  1 , 2 , 3 ] ,  [2 . 0 10 91 54 50 67 32 37e+02 , 3 . 2 30 02 38 48 812 932e+02 , 7 . 607 7552 04 78 360 6e+01 ] )  . 
switch {erk{3) , unfixed,  [1,2,3],  [2 . 7 53582588 953640e-02 , 4 . 8 10 62 38 67 50 3612e+02 , 1 . 2 50 64 6310 40 158 6e+02 ] )  . 
switch {erk{2) , unfixed,  [1,2,3],  [1.34650  54  7538  32  42e-01,3.8700  92554  70  32  87e+03,3.241480586358680e+02]). 
switch {erk{l) , unfixed, [1,2,3], [4 . 910 01 10 82 952420e+02 , 1 . 0 70252 6 9612 60 61e+02 , 2 . 0 92 07 14 30 17 5520e+00 ] ) . 
switch {pIP3, unfixed,  [1,2,3],  [ 5 . 7 40 4 640 18 132127e+02 , 4 . 4 02 01 1 90 94 601 51e+03 , 4 . 2 40 6647 67 051123e+02 ] )  . 
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List  of  Symbols,  Abbreviations,  and  Acronyms 


AP 

maximum  a  posteriori 

DAGs 

directed  acyclic  graphs 

DNA 

deoxy  ribonucleic  acid 

GRNs 

Genetic  Regulatory  Networks 

MAP 

maximum  a  posteriori 

ML 

maximum  likelihood 

mRNA 

messenger  ribooxy  nucleic  acid 

VB-EM 

Variational  Bayes-Expectation  Maximization 
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